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ABSTRACT 

We consider a MIMO interference channel in which the 
transmitters and receivers operate in frequency-division du¬ 
plex mode. In this setting, interference management through 
coordinated transceiver design necessitates channel state in¬ 
formation at the transmitters (CSI-T). The acquisition of CSI- 
T is done through feedback from the receivers, which entitles 
a loss in degrees of freedom, due to training and feedback. 
This loss increases with the amount of CSI-T. In this work, 
after formulating an overhead model for CSI acquisition at 
the transmitters, we propose a distributed mechanism to find 
for each transmitter a subset of the complete CSI, which is 
used to perform interference management. The mechanism 
is based on many-to-many stable matching. We prove the 
existence of a stable matching and exploit an algorithm to 
reach it. Simulation results show performance improvement 
compared to full and minimal CSI-T. 

1. INTRODUCTION 

Ultra-dense networks have been identified as one of the key 
scenarios in 5G communication systems, characterized by a 
high number of nodes, located in close proximity, in both het¬ 
erogeneous and homogeneous networks [I]. Evidently, coor¬ 
dination among transmitters and receivers is vital in such sce¬ 
narios, where it is well-known that the sum-rate performance 
is limited by unsuppressed interference. Forward-backward 
training algorithms, such as 00. employ uplink and down¬ 
link pilots to estimate the required local channel state infor¬ 
mation (CSI) quantities, and iteratively refine the precoding / 
decoding matrices in a cluster of cooperating nodes. 

It becomes clear at this stage that the overhead associ¬ 
ated with such clusters is a major concern (especially in the 
aforementioned dense networks): this motivates the need for 
schemes that take into account the loss in performance in¬ 
curred by coordination, training and possibly feedback. This 
problem was addressed in |5j where the authors proposed a 
training and analog feedback scheme that maximizes the ef¬ 
fective sum-rate (the achievable sum-rate in the network ac¬ 
counting for the loss in degrees of freedom (DoF) due to over¬ 
head). Furthermore, the issue of user partitioning was tackled 


in jSj where several schemes for partitioning the users into or¬ 
thogonal groups are proposed. 

We propose in this work an overhead-aware framework 
for distributed cooperation in frequency-division-duplex 
(FDD) systems: sets of potentially cooperating transmit¬ 
ters and receivers are formed using many-to-many stable 
matching, where the utilities at both transmitters and re¬ 
ceivers are designed to take into account both performance of 
each link, and the associated overhead for channel estimation 
and feedback. We prove that the formulated model satisfies 
the conditions for the existence of a many-to-many stable 
matching. By utilizing an algorithm to reach a stable match¬ 
ing we provide a distributed mechanism to determine the 
amount of CSI present at each transmitter which is exploited 
for interference management. This being said, any algorithm 
for precoder optimization can now be employed to optimize 
any desired metric (interference leakage, sum-rate, etc.). Fi¬ 
nally, our simulations indicate that our proposed distributed 
scheme offers gains in performance over selected benchmark 
schemes. 

Notations: Column vectors and matrices are given in 
lowercase and uppercase boldface letters, respectively, tr(-), 
|| • || F , and (-)t denote respectively the trace, Frobenius norm, 
and Hermitian transpose. I is an identity matrix. 

2. SYSTEM MODEL 

Consider a set of transmitter-receiver pairs 1C = {1,..., K} 
operating in the same spectral band. The transmitters and re¬ 
ceiver are equipped with multiple antennas such that transmit¬ 
ter j uses Mj antennas and receiver k uses N k . antennas. The 
flat fading channel matrix from a transmitter j to a receiver k 
is Hjk £ C iVfc x M .i . The received signal at a receiver k is 

Vk — 1 V 7j k Hj kV jSj + Zfc, (1) 

where Sj is the transmitted signal vector of dimension dj, 
Vj £ C M :i xd 'i is the transmit precoding matrix at transmit¬ 
ter j, t jk is the pathloss coefficient, and z& ~ CAf(0,o 2 I) 
is additive white Gaussian noise. The transmit precoding ma¬ 
trix V j used at transmitter j is restricted to a power constraint 

such as tr (VjV'-'j < Pj, j £ 1C. 


Given a pre-log factor /3 £ [0,1], determined by the lost 
temporal degrees of freedom due to training and overhead |[5j 
|6), the spectral efficiency of link k is given as 


have been used before, e.g. in |6). Given these assumptions, 
we now count the number of symbol intervals that are needed 
in the CSI acquisition to get a measure of the overhead. 
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® 2.2.1. Phases of CSI Acquisition 


where Z k = a 2 1 + Yhj^k V H jk V is the interfer¬ 

ence and noise covariance matrix. 

2.1. CSI-T Sharing Set 

The precoding matrices are designed based on the current net¬ 
work conditions in the form of CSI. We consider an FDD sys¬ 
tem, meaning that CSI at the receivers (CSI-R) can be ob¬ 
tained by channel training, but due to the non-reciprocal na¬ 
ture of the channel, feedback must be used in order to obtain 
CSI at the transmitters (CSI-T). Both the channel training and 
the CSI feedback lead to overhead, whose impact on the spec¬ 
tral efficiency depends on the channel coherence time. 

In our model, each receiver k has a non-zero channel from 
each transmitter j (cf. ([TJ). The corresponding links are in¬ 
dexed by the index set J = {(j, k ) | (j, k) £ 1C x 1C}. Due 
to the path loss factor 7 j k in Q, it is clear that some of the 
cross-links will interfere more strongly than others. We thus 
define I C J, which will be called a CSI-T sharing set. This 
set specifies what CSI is fed back from the receivers to the 
transmitters, and should thus correlate with which cross-links 
are deemed important to treat in the precoder design. Clearly, 
( k , k) £ I, V k £ 1C. The output of the stable matching 
algorithm in Sec. [3] will be such a CSI-T sharing set. 

The CSI-T sharing set I describes how incomplete the 
CSI-T is. This can range from minimal CSI-T (X = {(k. k ) | 
k £ 1C}), to complete CSI-T (I = J). In certain scenarios, it 
has been shown that the feasibility of interference alignment 
can be retained under some level of incomplete CSI-T |7|. 
Given a 1 C J, we now detail how the CSI acquisition will 
take place, and the corresponding overhead. 

2.2. Overhead Model 

The channel estimation is based on pilot-assisted training and 
the feedback is based on analog feedback 0 - We assume a 
block fading model, where the channel is constant for T sym¬ 
bol intervals (a coherence block). The CSI must be estimated 
once per coherence block. 

We now propose a simple overhead model. For the chan¬ 
nel training, we assume that minimal training is sufficient, 
i.e. that each channel coefficient can be identified using a sin¬ 
gle pilot symbol. For the analog feedback, we assume that 
the minimal number of symbol intervals for orthogonalizing 
the feedback between users is sufficient for acceptable perfor¬ 
mance. Admittedly, these assumptions might be coarse ap¬ 
proximations of proper system design at low signal to noise 
ratio (SNR), but they allow us to clearly compare the over¬ 
head of training and CSI feedback. Similar approximations 


The CSI acquisition has the following phases, which are re¬ 
peated in each coherence block. 

T1 Downlink and uplink channel training. In this phase, 
the receivers acquire CSI-R which will be fed back to the 
transmitters during the F phase, as well as being used for 
formulating the receiver utilities in the SM phase. To en¬ 
able succesful decoding in the F phase, the uplink chan¬ 
nels are also estimated. For pilot orthogonality reasons, 
this phase requires Ylk=i M k + N k symbol intervals. 

SM Stable matching. This phase determines the CSI-T shar¬ 
ing set I using the stable matching algorithm in Sec. [3] 
This phase requires Lsm symbol intervals of communica¬ 
tion, which will be quantified in Sec. [3] 

F Analog CSI feedback. In this phase, the transmitters ac¬ 
quire the CSI-T needed for the precoder design. Given a 
CSI-T sharing set I C J. receiver k feeds back the CSI 
for j £ {i | (*, k) £ X} to all transmitters * £ 1C. As¬ 
suming analog feedback, distributed processing [5] Sec. 
III.A] and N k < Mj , for all k. j £ 1C, this phase re¬ 
quires k)ex^J symbol intervals. For further details 
of the feedback mechanics and the resulting symbol inter¬ 
val overhead, see |5j Sec. III.A]. 

T2 Downlink effective channel training. After precoder op¬ 
timization, the resulting effective channels (i.e. channel 
matrices multiplied by precoders) are estimated. This re¬ 
quires | d k symbol intervals. 

Summing up the number of symbol intervals needed for train¬ 
ing and feedback, the CSI acquisition overhead is 

L csi = J2 k=1 ( M fc + N k + d k ) + X) (j>fc)eI M r ( 3 ) 

Note that the first term in the summation (accumulated train¬ 
ing overhead) is linear in K, whereas the second term (feed¬ 
back overhead) in the worst case becomes quadratic in K. 
Hence, in terms of overhead reduction, there are large gains 
to be anticipated by reducing the amount of feedback. 

After the five phases of CSI acquisition, data transmission 
takes place during the remaining L dMd = T — Lcsi — Tsm 
symbol intervals. Then, the pre-log factor in Q is 

P = L da jT = 1 - (L csl + L su )/T. (4) 

It is now clear that optimizing R k in (|2) becomes a trade¬ 
off between better interference management (higher spectral 
efficiency factor) and lower overhead (higher pre-log factor). 
Alhough the above method is inherently distributed, it enables 
transmitters within the CSI-T sharing set to have “global CSI- 
T” and thus perform the precoding in a centralized fashion. 




Note however that the complexity of exhaustively searching 
all possible CSI-T sharing sets is 2 K ~ K . Next, we detail an 
efficient and distributed procedure based on stable matching 
which will determine the CSI-T sharing set. 

3. DISTRIBUTED CSI SELECTION 

Many-to-many stable matching has been of interest for its ap¬ 
plication in the job matching problem [|8j. There, a set of 
firms and a set of workers exist, where each firm has a set 
of vacant positions to offer to workers, and each worker can 
work at more than one firm. The interest of a firm is to hire 
the best workers and each worker’s interest is to work at the 
most preferred combination of firms. The solution of the job 
matching problem is a many-to-many stable matching. 

The job matching problem relates to the problem in our 
setting in the following. First, we seek a matching between 
the set of transmitters (firms) and the receivers (workers) 
which dictates the CSI-T sharing set. Second, the stability of 
the solution concept supports a distributed implementation of 
the mechanism. 

3.1. Stable Matching Model 

Consider the set of transmitters T = {tx i, ..., tx ^} and the 
set of receivers TZ = {rx 1 ,, txk}- A matching between 
the two sets is defined as follows. 

Definition 1. A matching A4 is a correspondence from the 
set TUU to the set of all subsets off UlZ and satisfies the 
following properties for j £ f and k £ 1Z: 

1. M(j) £ IZuU), andM(k) £ TU0, 

2. k £ M.(j) if and only if j £ A4(k). 

The matching Ad is a set valued function such that fii(j) 
is the set of receivers matched to transmitter j £ f and Ad(fc) 
is the set of transmitters matched to receiver k £ TZ. If a trans¬ 
mitter j is unmatched, then A4(j) = 0. Similarly, Ad(fc) = 0 
means that receiver k is unmatched. Condition 2. in Defini¬ 
tion [T| ensures that whenever a transmitter j is matched to a 
receiver k then k would be also matched to transmitter j. 

3.1.1. Receiver and Transmitter Preference Sets 

Each receiver k £ TZ must have a strict, transitive and com¬ 
plete preference relation over the set 2 r containing all subsets 
of T. For a given set of transmitters B C f, a receiver k is 
able to select the most preferred subset of B by solving: 

Cfe X ($) = argmax d> r k x {j) s.t. |<S| < q™, (5) 

SgB 


and (f £ N is called the matching quota of receiver k. The 
function in ([6j» reflects a measure on how much the channel 
from transmitter j is aligned to the direct channel. This model 
shares similarities with the utility functions formulated in 0 
in the context of cognitive radio. If this measure is large, 
then the corresponding transmitter j could potentially gener¬ 
ate substantial amount of interference at receiver fc, and hence 
an effort in terms of CSI feedback and precoding should be 
taken to manage it. An unmatched receiver has zero utility. 

In (|5j, a receiver k prefers to be matched with the trans¬ 
mitters which maximize the sum of the measures defined in 
0- with the constraint that the total number of matched trans¬ 
mitters is not more than a design integer q'f £ N. Observe, 
that Problem {5} can be solved with low complexity by a 
greedy method since the objective functions are additively 

separable 03- 

Similarly, each transmitter j must have a strict, transitive 
and complete preference relation over the set 2 R of all subsets 
of TZ. Given a set of receivers B C TZ, we define the subset of 
B which transmitter j prefers the most as 

Cf(B)= argmax V </>f(fc) s.t. |S| < qf, (7) 
S&B 

where 

if k£lZ 

y ’ ]0 if fc = 0 ^ 

and q tx £ N is the quota of transmitter j. The function in ([Sj 
for a transmitter j is increasing in the strength of the channel 
to a receiver k and decreasing with the number of antennas. 
Since the number of antennas is proportional to the channel 
feedback overhead for CSI-T acquisition, the ratio in <[8]» re¬ 
flects the amount of interference relative to overhead. If this 
measure is high, then a matching between transmitter j and 
receiver k is desirable. 

3.1.2. Many-to-many Stable Matching 

The first of two requirements for stability is the following. 

Definition 2. Matching A4 is individually rational if 

1. no transmitter j £ f exists with M. (j) C tx (yV! (j)), 

2. no receiver k £ TZ exists with M.(k) C 7 k x {M.(k)). 

Condition 1. says that a matching M is not individu¬ 
ally rational for a transmitter j if the set of receivers M(j) 
matched to transmitter j are not all within the solution of the 
optimization in ([7]) with A4 (j ) as input. Analogously condi¬ 
tion 2. for a receiver k. 


where the functions (ffif (j ) are defined as 


y/7jk\\H] k H kk \\ F 
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Definition 3. Matching A4 is pairwise stable if there does not 
exist a pair (k, j) £ TZ x T such that 
1. 

2. k £ Cj x (M(j) U {fc}) and j £ C]f {M{k) U {j}) 


4> r k(j) 


o 


if j e T 
if j = 0 



Pairwise stability requires that there exist no receiver k 
and no transmitter j which are not matched to each other but 
prefer a matching between themselves. The two conditions in 
the second requirement mean, respectively, that transmitter j 
is in the solution set of receiver k’s optimization problem in 
0 given the set M. (fc)U {j} as input, and receiver k would be 
in the solution set of the optimization problem of transmitter 
j in 0 with A i(j) U {k} as input. 

Definition 4. A matching A4 is stable if it is individually ra¬ 
tional (Definition [2]) and pairwise stable (Definition Q. 

Generally, a many-to-many stable matching need not ex¬ 
ist. We prove this issue positively in the following. 

Theorem 1. A stable matching exists in our setting. 

Proof. A sufficient condition for the existence of a stable 
matching (Definition [4| is when the preferences of the trans¬ 
mitters and receivers satisfy the gross substitute property |8j. 
Our formulation of the utility functions satisfy this property 
according to ED Section 2] since ([5]) and <[7j correspond to 
the q-satiation of the performance. □ 

A stable matching in our setting depends strongly on the 
choice of quotas for each transmitter and receiver, incorpo¬ 
rated in problems 0 and 0. Efficient modeling for the quo¬ 
tas seems to be a hard task which will depend on several pa¬ 
rameters in the considered scenario. For a symmetric system, 
it is possible to determine the quotas as a function of required 
pre-log factor. 

Theorem 2. Assume a symmetric network in which M k = 
M,Nj = N,dk = d for all k £ 1C. In addition let q k x = 
= 9 f or a U k C 1C, then for a given minimum desired 
pre-log factor f3 £ ^rnm^maxj w f lere 

pmin = 1 _ K ( 2M + N + d ) j T ^ ( 9 ) 

pmax = x _ X(KM + M + N + d)/T, (10) 

the following condition should hold 

q<lT(l-$)/KM-l-(N + d)/M\. (11) 

Proof. First observe that since each transmitter and receiver 
has the same quota q, each transmitter (receiver) will be 
matched with q other receivers (transmitters) in a stable 
matching. This fact follows from the individual rationality 
assumption (Definition [2]). Consequently, the analog CSI 
feedback overhead is equal to qKM. Using Q we can for¬ 
mulate the condition on the quota q in (IT) . The bounds on f3 
correspond to minimal and full CSI-T sharing sets. □ 

3.1.3. Algorithm and Distributed Implementation 

Algorithm[T]describes the steps to reach a many-to-many sta¬ 
ble matching which is based on the proposed algorithm in |8j. 


Algorithm 1 Stable matching algorithm (8). 

Initialize: i = 0, 7j? = T, for all k £ 1Z, Lsm = 0 

i: repeat 

2: for k £ 1Z do 

3: Receiver k applies to C r k x (T k ) with no transmitter 

in T k has rejected him before. 

4 : Lsm = Lsm + 1 ; 

5: for j £ T do 

6: Transmitter j accepts Cjf ( V *) with no receiver in 

v; = {scn\j£c?(w, vfc e 5} G 2 ) 

it rejected before. 

7: Transmitter j rejects Vj \ Cjf : (Vj ). 

8: Lsm = Lsm + 1 ; 

9: i = i + 1; 

10: until no receiver applies to any transmitter 


In Step 3, each receiver “applies” to its most preferred trans¬ 
mitters by sending a message to each of them. This mes¬ 
sage includes the norm of the interference channel from the 
transmitter to the applying receiver which is needed for the 
transmitter’s optimization problem in (|7j. The overhead for 
sending this message is assumed to be a single symbol inter¬ 
val (Step 4). On receiving all applications, each transmitter 
accepts the receivers based on its optimization problem in 0 
and rejects the rest. Here, the binary decisions are communi¬ 
cated to the applying receivers using a single symbol interval. 

During the algorithm, the receivers apply only to the 
transmitters which have not rejected them before. The trans¬ 
mitters, upon obtaining the applications from the receivers, 
choose the ones they prefer most and which they have not 
rejected before. Algorithm [I] is guaranteed to converge to a 
stable matching [j8j Proposition 5]. Moreover, the algorithm’s 
complexity is fairly low given that each receiver applies at 
most one time to any transmitter {8] Proposition 3]. 

4. NUMERICAL RESULTS 

We consider 25 transmitter-receiver pairs. The transmitters 
are uniformly distributed in a 250 x 250 m 2 region, and each 
receiver is randomly located at a distance of 50 m from its 
transmitter. We set the number of antennas as Nk = Mj = 5 
for all j, k, and the number of data streams per user is two. We 
assume Rayleigh fading such that Hjk ~ CN{ 0, 1) for all 
j, k £ 1C. Let the distance between transmitter j and receiver 
k be a,jk. The pathloss coefficient is modeled as 7 jk = a~ k . 
The coherence block is assumed T = 10 4 symbol intervals. 

Utlizing Theorem [2] to determine the quotas, we let the 
desired pre-log factor to be decreasing in SNR == Kt)/^ 2 
as /3 = a(SNR)/3 mm + (1 — a(SNR))/3 max , where we in¬ 
troduce the function a(SNR) = log(l + SNR/(l + SNR)). 
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Fig. 1. Performance of different CSI-T selection schemes. 


In Fig. |T] the average sum spectral efficiency in ([2]) is plot¬ 
ted for different CSI selection schemes. The average is taken 
over 250 random system deployments. The overhead in CSI 
training corresponding to the plots in Fig.|T]is shown in Fig. [2] 
In the minimal CSI-T sharing scheme, the transmitters only 
have CSI-T regarding the direct channels. Precoding matri¬ 
ces are then calculated according to the eigenvectors of the 
direct channels. In the full CSI-T sharing scheme, complete 
CSI-T is present at all transmitters. This scheme requires the 
largest amount of overhead in the F phase (Sec. 2.2.1) as is 
shown in Fig. [2] For calculating the transceivers, we use the 
alternating optimization algorithm in |4| in which we fix the 
maximum number of iterations to five. It can be seen in Fig. [T] 
that the performance of minimal CSI-T sharing is relatively 
high at low SNR while at high SNR interference management 
is necessary despite a high overhead in CSI-T acquisition. 

The stable matching CSI-T selection scheme corresponds 
to Algorithm [T] After the F phase, the algorithm from |4| is 
applied at the transmitters using the possibly incomplete CSI- 
T. The performance of the stable matching scheme is shown 
to outperform minimal and full CSI-T sharing. By neglect¬ 
ing the induced overhead, we compare our distributed algo¬ 
rithm with the centralized greedy selection scheme proposed 
in 03 for determining the users which perform interference 
alignment in partially connected networks. It can be observed 
that the performances of the two algorithms are very similar, 
where we have used our novel model for the quotas for both 
algorithms. When consider the actual overhead ijif, it can 
be seen that the incurred performance loss is very small in 
stable matching. This overhead L$m, which is influenced by 
the complexity of the stable matching algorithm, is relatively 
small compared to the overhead in CSI feedback. 
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Fig. 2. Corresponding average data transmission time /3. 
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